Predicting Proteins Interactions from Protein Sequence Features using Support Vector Machines
نویسندگان
چکیده
Computational methods to predict proteinprotein interactions are becoming increasingly important. This is due to the fact that most of the interactions data have been identified by highthroughput technologies like the yeast two-hybrid system which are known to yield many false positives. In this paper we investigate the use of two protein sequence features, namely, domain structure and hydrophobicity properties. The support vector machines (SVM) has been used as a learning system to predict protein interactions based only on protein sequence features. Protein domain structure and hydrophobicity properties are used separately as the sequence feature. Both features achieved accuracy of about 80%. But domains structure had receiver operating characteristic (ROC) score of 0.8480, while hydrophobicity had ROC score of 0.8159. These results indicate that protein-protein interaction can be predicted from domain structure with relatively better accuracy than hydrophobicity.
منابع مشابه
Support Vector Machines for Predicting Protein-Protein Interactions
Protein-protein interactions play a crucial role in cellular processes. Although these interactions should be determined by stringent experiments and succeeded assays, these steps are time-consuming and labor-intensive. For this reason, many efforts have been made to predict unknown protein-protein interactions in silico from genomic and/or other information. One of the problems we have faced i...
متن کاملSupport vector machines for predicting rRNA-, RNA-, and DNA-binding proteins from amino acid sequence.
Classification of gene function remains one of the most important and demanding tasks in the post-genome era. Most of the current predictive computer methods rely on comparing features that are essentially linear to the protein sequence. However, features of a protein nonlinear to the sequence may also be predictive to its function. Machine learning methods, for instance the Support Vector Mach...
متن کاملPredicting cardiac arrhythmia on ECG signal using an ensemble of optimal multicore support vector machines
The use of artificial intelligence in the process of diagnosing heart disease has been considered by researchers for many years. In this paper, an efficient method for selecting appropriate features extracted from electrocardiogram (ECG) signals, based on a genetic algorithm for use in an ensemble multi-kernel support vector machine classifiers, each of which is based on an optimized genetic al...
متن کاملPrediction of Contact Maps Using Support Vector Machines
Contact map prediction is of great interests for its application in fold recognition and protein 3D structure determination. In particular, we focusd on predicting non-local interactions in this paper. We employed Support Vector Machines (SVMs) as the machine learning tool and incorporated AAindex to extract correlated mutation analysis (CMA) and sequence profile (SP) features. In addition, we ...
متن کاملComparison of Domain and Hydrophobicity Features for the Prediction of Protein-Protein Interactions using Support Vector Machines
The protein domain structure has been widely used as the most informative sequence feature to computationally predict protein-protein interactions. However, in a recent study, a research group has reported a very high accuracy of 94% using hydrophobicity feature. Therefore, in this study we compare and verify the usefulness of protein domain structure and hydrophobicity properties as the sequen...
متن کامل